extract n-gram features from text